A multichannel speech/silence detector based on time delay estimation and fuzzy classification

نویسندگان

  • Francesco Beritelli
  • Salvatore Casale
  • Alfredo Cavallaro
چکیده

Discontinuous transmission based on speech/pause detection represents a valid solution to improve the spectral efficiency of new-generation wireless communication systems. In this context, robust Voice Activity Detection (VAD) algorithms are required, as traditional solutions present a high misclassification rate in the presence of the background noise typical of mobile environments. The Fuzzy Voice Activity Detector (FVAD) recently proposed in [1], shows that a valid alternative to deal with the problem of activity decision is to use methodologies like fuzzy logic. In this paper we propose a multichannel approach to activity detection using both fuzzy logic and time delay estimation. Objective and subjective tests confirm a significant improvement over traditional methods, above all in terms of a reduction in activity increase for non stationary noise.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Noise Estimation based on Entropy without using VAD for Speech Enhancement

A practical speech enhancement system consists of two major components, the estimation of noise power spectrum, and the estimation of speech.In single channel speech enhancement systems, most algorithms require an estimation of average noise spectrum since a secondary channel is not available. This requires a reliable speech/silence detector. Thus the speech/silence detection can be a determini...

متن کامل

Time Delay Compensation for Adaptive Multichannel Speech Enhancement Systems

Several algorithms for adaptive multichannel speech enhancement have been tested in an office room and in an anechoic chamber using different noise sources. Temporal synchronisation requires time delay estimation and compensation of the desired speech signal received at different sensors.

متن کامل

Stability analysis and feedback control of T-S fuzzy hyperbolic delay model for a class of nonlinear systems with time-varying delay

In this paper, a new T-S fuzzy hyperbolic delay model for a class of nonlinear systems with time-varying delay, is presented to address the problems of stability analysis and feedback control. Fuzzy controller is designed based on the parallel distributed compensation (PDC), and with a new Lyapunov function, delay dependent asymptotic stability conditions of the closed-loop system are derived v...

متن کامل

Robust time delay estimation exploiting redundancy among multiple microphones

To find the position of an acoustic source in a room, typically, a set of relative delays among different microphone pairs needs to be determined. The generalized cross-correlation (GCC) method is the most popular to do so and is well explained in a landmark paper by Knapp and Carter. In this paper, the idea of cross-correlation coefficient between two random signals is generalized to the multi...

متن کامل

The Effect of a Voice Activity Detector on the Speech Enhancement Performance of the Binaural Multichannel Wiener Filter

A multimicrophone speech enhancement algorithm for binaural hearing aids that preserves interaural time delays was proposed recently. The algorithm is based on multichannel Wiener filtering and relies on a voice activity detector (VAD) for estimation of second-order statistics. Here, the effect of a VAD on the speech enhancement of this algorithm was evaluated using an envelopebased VAD, and th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999